SEQUENCE LISTING 



(1) GENERAL INFORMATION: / 

(i) APPLICANT: Wallis, Nicola 4. 

Burnham, Martin K. R. 

(ii) TITLE OF INVENTION: mur(C 

(iii) NUMBER OF SEQUENCES^ 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Deche/t, Price & Rhoads 

(B) STREET: 4000 BeA Atlantic Tower, 1717 Arch Stre 

(C) CITY: Philadelphia 

(D) STATE: PA / 

(E) COUNTRY: USA/ 

(F) ZIP: 19103-^793 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER/ IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

<vi) CURRENff APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PNIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/052,720 

(B) HI LING DATE: 03-JUL-1997 



(viiii) ATTORNEY/AGENT INFORMATION: 
(A/ NAME: Falk, Stephen T 
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(B) REGISTRATION NUMBER: 36,795 / 

(C) REFERENCE /DOCKET NUMBER: Gm/o025 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-994-2488 / 

(B) TELEFAX: 215-994-2222 / 

(C) TELEX: / 

(2) INFORMATION FOR SfQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 bale pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : /double 

(D) TOPOLOGY: linfear 



/ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATGAGTAAGG AGTTTTATAF AATGACACAC TATCATTTTG TCGGAATTAA AGGTTCTGGC 60 

(atgagttcat TAGCACA2AT CATGCATGAT TTAGGACATG AAGTTCAAGG ATCGGATATT 120 

GAGAACTACG TAT TT AC AG A AGTTGCTCTT AGAAATAAGG GGATAAAAAT ATTACCATTT 180 

GGTGCTAATA ACATAAAAGA AGATATGGTA GTTATACAAG GTAATGCATT CGCGAGTAGC 24 0 

CATGAAGAAA TAGTAGfGTGC ACATCAATTG AAATTAGATG TTGTAAGTTA TAATGATTTT 300 

TTAGGACAGA TTATyGATCA ATATACTTCA GTAGCTGTAA CTGGTGCACA TGGTAAAACT 360 

TCTACAACAG GTTTJATTATC ACATGTTATG AATGGTGATA AAAAGACTTC ATTTTTAATT 420 

GGTGATGGCA CAGGTATGGG ATTGCCTGAA AG T GAT T ATT TCGCTTTTGA GGCATGTGAA 4 80 

TATAGACGTC ACffTTTTAAG TTATAAACCT GATTACGCAA T TAT G AC AAA TATTGATTTC 54 0 

GATCATCCTG ATTATTTCAA AGATATTAAT GATGTTTTTG ATGCATTCCA AGAAATGGCA 600 

CATAATGTTA AAAAAGGTAT TATTGCTTGG GGTGATGATG AACATCTACG TAAAATTGAA 660 

GCAGATGTTC /CAATTTATTA CTATGGATTT AAAGATTCGG ATGACATTTA TGCTCAAAAT 720 

ATTCAAATTi/ CGGATAAAGG TACTGCTTTT GATGTGTATG TGGATGGTGA GTTTTATGAT 780 

CACTTCCTGT CTCCACAATA TGGTGACCAT ACAGTTTTAA ATGCATTAGC TGTAATTGCG 840 

AT TAG TT ATI TAGAGAAGCT AGATGTTACA AATATTAAAG AAGCATTAGA AACGTTTGGT 900 

GGTGTTAAff^C GTCGTTTCAA TGAAACTACA ATTGCAAATC AAGTTATTGT AGATGATTAT 960 

GCACACCATC CAAGAGAAAT TAGTGCTACA AT TG AC AC AG CACGAAAGAA ATATCCACAT 1020 

AAAGAAffiTTG TTGCAGTATT TCAACCACAC ACTTTCTCTA GAACACAAGC ATTTTTAAAT 1080 

GAATTTTGCAG AAAGTTTATG TAAAGCAGAT CGTGTATTCT TATGTGAAAT TTTTGGCTCA 114 0 

ATTAGAGAAA ATTCTGGCGC ATTAACGATA CAAGATTTAA TTGATAAAAT TGGAGGTGCA 1200 

TCGT7CATTA ATGAAGATCT TATTAATGTA TTAGAACAAT TTGATAATGC TGTTGTTTTA 1260 
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TTTATGGGTG CAGGTGATAT TCAAAAATTA CAAAATGCAT ATTTAGATAA ATTAGGCATG / 1320 
AAAAATGCGT TTTAATATGT TTATAATAGA G / 1351 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2/ 



His Phe Val 
5 

Met His Asp 



Gly He 

Leu Gly 

25 
Glu V< 
40 

AsA Asn 
Ser Ser 




Lys Lys Gly 
Glu Ala Asp 



Lys Gly Ser Gly Met Ser Ser 
10 / 15 
_s Glu Val Gin Gly Ser Asp 
30 

Ala Leu Arg Asn Lys Gly He 
45 

He Lys Glu Asp Met Val Val 
60 

His Glu Glu He Val Arg Ala 

75 80 
Tyr Asn Asp Phe Leu Gly Gin 
90 95 
Val Thr Gly Ala His Gly Lys 
110 

Val Met Asn Gly Asp Lys Lys 
125 

Gly Met Gly Leu Pro Glu Ser 
140 

Tyr Arg Arg His Phe Leu Ser 
155 160 
Asn He Asp Phe Asp His Pro 
170 175 
Phe Asp Ala Phe Gin Glu Met 
190 

Ala Trp Gly Asp Asp Glu His 
205 

He Tyr Tyr Tyr Gly Phe Lys 
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Phie Asp Asn 

Leu Gin Asn 
425 



Ala Asp Arg 

Ser Gly Ala 
380 

Ser Phe lie 

395 
Ala Val Val 
410 

Ala Tyr Leu 



Thr Asp Lys Qly 
240 

Asp His Ptfe Leu 
^255 

Leu hYa Val lie 

170 

lid Lys Glu Ala 
15 

Glu Thr Thr He 

Pro Arg Glu He 
320 

His Lys Glu Val 
335 

Gin Ala Phe Leu 
350 

Val Phe Leu Cys 
365 

Leu Thr He Gin 

Asn Glu Asp Leu 
400 

Leu Phe Met Gly 
415 

Asp Lys Leu Gly 
430 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) MENGTH: 660 base pairs 

(B) /tyPE: nucleic acid 
(c/ STRANDEDNESS: double 
(B) TOPOLOGY: linear 



/(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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ATTTAAAGAT 


T C GG AT GAC A 


TTTATGPTPA 

XXX xi. X vJ X \_^xi 


AATATTTCAA 

xIaa x *i xxx wiin 


ATTACGGATA 


AAGGTACTGC 


/ 60 


X O J. 1 Ort. lulu 


TATGTGGATG 


o i unvj x x x x iri 


TGATPAPTTP 
x on x x x \^ 


PTGTPTPPAP 


AATATGGTGA / 


120 


PPATAPAGTT 


TTAAATGPAT 


TAGPTGTAAT 
x nuu l u i nn. x 


TGPGATTAGT 

x u^uzi x inu x 


TATTTAGAGA 

x ii xxx nuiivjn 


AGCTAGATQT 


180 


TAPAAATATT 

x nvnrui x a x x 


AAAGAAGPAT 


TAGAAAPGTT 


TGGTGGTGTT 

X \J\J X NwJ \JJ X \J X X 


AAAPGTCGTT 


TCAATG^AC 


240 


TAPAATTGPA 


AATPAAGTTA 


TTGTAGATGA 

X X \_J X ii\Jii X \Jii 


TTATGPAPAP 

X X X vJ viiviiw 


CATCCAAGAG 


AAAT'EAGTGC 


300 


TAPAATTGAP 


ACAGCACGAA 


AGAAATATCC 


ACATAAAGAA 


GTTGTTGCAG 


TAT^TCAACC 


360 


ACACACTTTC 


TCTAGAACAC 


AAGCATTTTT 


AAATGAATTT 


G C AG AAAG T T 


I i\A(j J. AAAbL 


/ion 


AGATCGTGTA 


TTCTTATGTG 


AAATTTTTGG 


ATCAATTAGA 


GAAAAT AC T G, 


/GCGCATTAAC 


480 


GATACAAGAT 


TTAATTGATA 


AAATTGAAGG 


TGCATCGTTA 


ATTAATGAAG 


ATTCTATTAA 


540 


TGTATTAGAA 


CAATTTGATA 


ATGCTGTTGT 


TTTATTTATG 


GGTGCAGGTG 


ATATTCAAAA 


600 


ATTACAAAAT 


GCATATTTAG 


ATAAATTAGG 


CATGAAAAAT 


gcgtt/taat 


ATGTTTATAA 


660 



(2) INFORMATION FOR SEQ ID NO: 4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Phe Lys Asp 
1 

Lys Gly Thr 



<xi) SEQUENCE DESCRIPTION? : SEQ ID NO: 4 
Ile/Tyr 
al Tyr 



Phe Leu Ser 
35 

Val lie Ala 
50 

Glu Ala Leu 
65 

Thr He Ala 

Glu He Se/r 

Glu Val/Val 
115 
Phe L&u Asn 

[30 



Ser Asp Asp 
5 

Ala Val Asp 
20 

Pro Gin T>Vr 



Gli/ Thr Phe 
70 

lsii Gin Val 
85 

Ala Thr He 
100 

Ala Val Phe 

Glu Phe Ala 



Gly Asp 

40 
Leu Glu 
55 

Gly Gly 

He Val 

Asp Thr 

Gin Pro 
120 
Glu Ser 
135 



Ala Gin He 
10 

Val Asp Gly 
25 

His Thr Val 



Lys Leu Asp 

Val Lys Arg 
75 

Asp Asp Tyr 
90 

Ala Arg Lys 
105 

His Thr Phe 
Leu Ser Lys 



Phe 

Glu 

Leu 

Val 

60 

Arg 

Ala 

Lys 

Ser 

Ala 
140 



Gin He Thr Asp 
15 

Phe Tyr Asp His 
30 

Asn Ala Leu Ala 
45 

Thr Asn He Lys 

Phe Asn Glu Thr 
80 

His His Pro Arg 
95 

Tyr Pro His Lys 
110 

Arg Thr Gin Ala 
125 

Asp Arg Val Phe 
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Leu Cys Glu lie Phe Gly Ser lie Arg Glu Asn Thr Gly Ala Leu Thi 
145 150 155 yio 

lie Gin Asp Leu lie Asp Lys lie Glu Gly Ala Ser Leu lie Asyf Glu 

165 170 y/lb 

Asp Ser lie Asn Val Leu Glu Gin Phe Asp Asn Ala Val Valr Leu Phe 

180 • 185 /90 

Met Gly Ala Gly Asp lie Gin Lys Leu Gin Asn Ala Ty^ Leu Asp Lys 

195 200 2^05 

Leu Gly Met Lys Asn Ala Phe 
210 215 

(2) INFORMATION FOR SEQ ID NO:^ 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sing/e 

(D) TOPOLOGY: linear/ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTTCATTAAT GAACGATGC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) ^LENGTH: 19 base pairs 
(b/ TYPE: nucleic acid 

;) STRANDEDNESS: single 
r (D) TOPOLOGY: linear 



19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GTmCAAATA TTAAAGAAG 



19 
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